Real Time Loading of Enterprise Data Using Fragmentation of Data Warehouses
نویسندگان
چکیده
Real-time ETL (Extraction, Transformation and Loading) of enterprise data is one of the foremost features of next generation Business Intelligence (BI 2.0). This paper presents a proposal for loading operational data in real time using a Data Warehouse (DW) architecture with faster processing time than current approaches. Distributed database techniques, like derived horizontal fragmentation in shared nothing architecture, are used to create fragments that are specialized in most current data and optimized to achieve continuous insertions. Using this approach, the DW can be updated near-line from operational data sources. As a result, DW queries are executed over real time data or very close to that. Moreover, continuous loadings do not impact queries response time. In addition, we extended the Star Schema Benchmark to address loading operational data in real time. This benchmark, including the new feature, is used to validate and demonstrate the efficiency of our approach compared to other ones.
منابع مشابه
Striving towards Near Real-Time Data Integration for Data Warehouses
The amount of information available to large-scale enterprises is growing rapidly. While operational systems are designed to meet well-specified (short) response time requirements, the focus of data warehouses is generally the strategic analysis of business data integrated from heterogeneous source systems. The decision making process in traditional data warehouse environments is often delayed ...
متن کاملReal time data loading and OLAP queries: Living together in next generation BI environments
Real time ETL (Extraction, Transformation and Loading) of enterprise data is one of the foremost features of next generation Business Intelligence (BI 2.0). This article presents a proposal for loading operational data in real time using a Data Warehouse (DW) architecture with faster processing time than current approaches. Distributed processing techniques, such as data fragmentation on top of...
متن کاملبهبود فرآیند استخراج، تبدیل و بارگذاری در پایگاه داده تحلیلی با کمک پردازش موازی
Abstract Data Warehouses are used to store data in a structure that facilitates data analysis. The process of Extracting, Transforming, and Loading (ETL) covers the process of retrieving required data from the source system and loading them to the data warehouse. Although the structure of source data (e.g. ER model) and DW (e.g. star schema) are usually specified, there is a clear lack of a ...
متن کاملValuation Factors for the Necessity of Data Persistence in Enterprise Data Warehouses on In-Memory Databases
ETL (extraction, transformation, and loading) and data staging processes in Enterprise Data Warehouses have always been critical due to their consumption of time and resources. Mostly, the staging processes are accompanied with persistent storage of transformed data to enable a reasonable performance when accessing for analysis and other purposes. The persistence of – often redundant – data req...
متن کاملAutomatic Workload Management for Enterprise Data Warehouses
Modern enterprise data warehouses have complex workloads that are notoriously difficult to manage. Additionally, RDBMSs have many “knobs” for managing workloads efficiently. These knobs affect the performance of query workloads in complex interrelated ways and require expert manual attention to change. It often takes a long time for a performance expert to get enough experience with a large war...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011